Dataset statistics
| Number of variables | 7 |
|---|---|
| Number of observations | 845552 |
| Missing cells | 1297165 |
| Missing cells (%) | 21.9% |
| Duplicate rows | 185 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 45.2 MiB |
| Average record size in memory | 56.0 B |
Variable types
| Categorical | 1 |
|---|---|
| Text | 5 |
| Numeric | 1 |
| Dataset has 185 (< 0.1%) duplicate rows | Duplicates |
CATEGORY_1 is highly imbalanced (77.8%) | Imbalance |
CATEGORY_3 has 60566 (7.2%) missing values | Missing |
CATEGORY_4 has 778093 (92.0%) missing values | Missing |
MANUFACTURER has 226474 (26.8%) missing values | Missing |
BRAND has 226472 (26.8%) missing values | Missing |
Reproduction
| Analysis started | 2025-03-09 21:30:45.493950 |
|---|---|
| Analysis finished | 2025-03-09 21:31:01.870927 |
| Duration | 16.38 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
CATEGORY_1
Categorical
IMBALANCE 
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 111 |
| Missing (%) | < 0.1% |
| Memory size | 6.5 MiB |
| Health & Wellness | |
|---|---|
| Snacks | |
| Beverages | 3990 |
| Pantry | 871 |
| Apparel & Accessories | 846 |
| Other values (22) | 2222 |
Length
| Max length | 22 |
|---|---|
| Median length | 17 |
| Mean length | 12.707907 |
| Min length | 5 |
Characters and Unicode
| Total characters | 10743786 |
|---|---|
| Distinct characters | 42 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Health & Wellness |
|---|---|
| 2nd row | Snacks |
| 3rd row | Health & Wellness |
| 4th row | Health & Wellness |
| 5th row | Health & Wellness |
Common Values
| Value | Count | Frequency (%) |
| Health & Wellness | 512695 | |
| Snacks | 324817 | |
| Beverages | 3990 | 0.5% |
| Pantry | 871 | 0.1% |
| Apparel & Accessories | 846 | 0.1% |
| Dairy | 602 | 0.1% |
| Needs Review | 547 | 0.1% |
| Alcohol | 503 | 0.1% |
| Home & Garden | 115 | < 0.1% |
| Restaurant | 69 | < 0.1% |
| Other values (17) | 386 | < 0.1% |
| (Missing) | 111 | < 0.1% |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 513877 | ||
| health | 512695 | |
| wellness | 512695 | |
| snacks | 324817 | |
| beverages | 3990 | 0.2% |
| pantry | 871 | < 0.1% |
| apparel | 846 | < 0.1% |
| accessories | 846 | < 0.1% |
| dairy | 602 | < 0.1% |
| review | 547 | < 0.1% |
| Other values (33) | 2043 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1555587 | |
| l | 1540142 | |
| s | 1357553 | |
| 1028388 | ||
| a | 844307 | |
| n | 838718 | |
| & | 513877 | 4.8% |
| t | 513857 | 4.8% |
| h | 513270 | 4.8% |
| H | 512834 | 4.8% |
| Other values (32) | 1525253 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10743786 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1555587 | |
| l | 1540142 | |
| s | 1357553 | |
| 1028388 | ||
| a | 844307 | |
| n | 838718 | |
| & | 513877 | 4.8% |
| t | 513857 | 4.8% |
| h | 513270 | 4.8% |
| H | 512834 | 4.8% |
| Other values (32) | 1525253 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10743786 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1555587 | |
| l | 1540142 | |
| s | 1357553 | |
| 1028388 | ||
| a | 844307 | |
| n | 838718 | |
| & | 513877 | 4.8% |
| t | 513857 | 4.8% |
| h | 513270 | 4.8% |
| H | 512834 | 4.8% |
| Other values (32) | 1525253 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10743786 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1555587 | |
| l | 1540142 | |
| s | 1357553 | |
| 1028388 | ||
| a | 844307 | |
| n | 838718 | |
| & | 513877 | 4.8% |
| t | 513857 | 4.8% |
| h | 513270 | 4.8% |
| H | 512834 | 4.8% |
| Other values (32) | 1525253 |
CATEGORY_2
Text
| Distinct | 121 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1424 |
| Missing (%) | 0.2% |
| Memory size | 6.5 MiB |
Length
| Max length | 46 |
|---|---|
| Median length | 35 |
| Mean length | 12.075909 |
| Min length | 3 |
Characters and Unicode
| Total characters | 10193613 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Sexual Health |
|---|---|
| 2nd row | Puffed Snacks |
| 3rd row | Hair Care |
| 4th row | Oral Care |
| 5th row | Medicines & Treatments |
| Value | Count | Frequency (%) |
| 301082 | ||
| care | 246440 | 13.0% |
| hair | 125081 | 6.6% |
| candy | 121036 | 6.4% |
| treatments | 117718 | 6.2% |
| medicines | 99118 | 5.2% |
| bath | 81469 | 4.3% |
| body | 81469 | 4.3% |
| skin | 62587 | 3.3% |
| nuts | 33522 | 1.8% |
| Other values (182) | 625771 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1110136 | 10.9% |
| 1051165 | 10.3% | |
| a | 976905 | 9.6% |
| i | 676575 | 6.6% |
| r | 671853 | 6.6% |
| n | 573186 | 5.6% |
| s | 534621 | 5.2% |
| t | 522516 | 5.1% |
| C | 449030 | 4.4% |
| d | 418113 | 4.1% |
| Other values (40) | 3209513 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10193613 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1110136 | 10.9% |
| 1051165 | 10.3% | |
| a | 976905 | 9.6% |
| i | 676575 | 6.6% |
| r | 671853 | 6.6% |
| n | 573186 | 5.6% |
| s | 534621 | 5.2% |
| t | 522516 | 5.1% |
| C | 449030 | 4.4% |
| d | 418113 | 4.1% |
| Other values (40) | 3209513 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10193613 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1110136 | 10.9% |
| 1051165 | 10.3% | |
| a | 976905 | 9.6% |
| i | 676575 | 6.6% |
| r | 671853 | 6.6% |
| n | 573186 | 5.6% |
| s | 534621 | 5.2% |
| t | 522516 | 5.1% |
| C | 449030 | 4.4% |
| d | 418113 | 4.1% |
| Other values (40) | 3209513 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10193613 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1110136 | 10.9% |
| 1051165 | 10.3% | |
| a | 976905 | 9.6% |
| i | 676575 | 6.6% |
| r | 671853 | 6.6% |
| n | 573186 | 5.6% |
| s | 534621 | 5.2% |
| t | 522516 | 5.1% |
| C | 449030 | 4.4% |
| d | 418113 | 4.1% |
| Other values (40) | 3209513 |
CATEGORY_3
Text
MISSING 
| Distinct | 344 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 60566 |
| Missing (%) | 7.2% |
| Memory size | 6.5 MiB |
Length
| Max length | 41 |
|---|---|
| Median length | 33 |
| Mean length | 17.102599 |
| Min length | 3 |
Characters and Unicode
| Total characters | 13425301 |
|---|---|
| Distinct characters | 54 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 56 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Conductivity Gels & Lotions |
|---|---|
| 2nd row | Cheese Curls & Puffs |
| 3rd row | Hair Care Accessories |
| 4th row | Toothpaste |
| 5th row | Essential Oils |
| Value | Count | Frequency (%) |
| 260403 | 12.5% | |
| candy | 107888 | 5.2% |
| hair | 71523 | 3.4% |
| treatments | 57832 | 2.8% |
| confection | 56965 | 2.7% |
| supplements | 55700 | 2.7% |
| herbal | 55700 | 2.7% |
| vitamins | 55700 | 2.7% |
| chocolate | 47710 | 2.3% |
| care | 44360 | 2.1% |
| Other values (479) | 1265194 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1293989 | 9.6% | |
| e | 1150071 | 8.6% |
| a | 1047002 | 7.8% |
| n | 907269 | 6.8% |
| s | 876002 | 6.5% |
| i | 866388 | 6.5% |
| o | 790373 | 5.9% |
| t | 773408 | 5.8% |
| r | 700702 | 5.2% |
| l | 446232 | 3.3% |
| Other values (44) | 4573865 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 13425301 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1293989 | 9.6% | |
| e | 1150071 | 8.6% |
| a | 1047002 | 7.8% |
| n | 907269 | 6.8% |
| s | 876002 | 6.5% |
| i | 866388 | 6.5% |
| o | 790373 | 5.9% |
| t | 773408 | 5.8% |
| r | 700702 | 5.2% |
| l | 446232 | 3.3% |
| Other values (44) | 4573865 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 13425301 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1293989 | 9.6% | |
| e | 1150071 | 8.6% |
| a | 1047002 | 7.8% |
| n | 907269 | 6.8% |
| s | 876002 | 6.5% |
| i | 866388 | 6.5% |
| o | 790373 | 5.9% |
| t | 773408 | 5.8% |
| r | 700702 | 5.2% |
| l | 446232 | 3.3% |
| Other values (44) | 4573865 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 13425301 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1293989 | 9.6% | |
| e | 1150071 | 8.6% |
| a | 1047002 | 7.8% |
| n | 907269 | 6.8% |
| s | 876002 | 6.5% |
| i | 866388 | 6.5% |
| o | 790373 | 5.9% |
| t | 773408 | 5.8% |
| r | 700702 | 5.2% |
| l | 446232 | 3.3% |
| Other values (44) | 4573865 |
CATEGORY_4
Text
MISSING 
| Distinct | 127 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 778093 |
| Missing (%) | 92.0% |
| Memory size | 6.5 MiB |
Length
| Max length | 47 |
|---|---|
| Median length | 36 |
| Mean length | 20.771402 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1401218 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 22 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Hair Brushes & Combs |
|---|---|
| 2nd row | Women's Shaving Gel & Cream |
| 3rd row | Lip Balms |
| 4th row | Already Popped Popcorn |
| 5th row | Women's Shaving Gel & Cream |
| Value | Count | Frequency (%) |
| 31106 | 14.1% | |
| treatments | 14079 | 6.4% |
| medicines | 12755 | 5.8% |
| lip | 11063 | 5.0% |
| popcorn | 10719 | 4.9% |
| balms | 9737 | 4.4% |
| hair | 7892 | 3.6% |
| already | 6974 | 3.2% |
| popped | 6974 | 3.2% |
| women's | 6170 | 2.8% |
| Other values (200) | 103029 |
Most occurring characters
| Value | Count | Frequency (%) |
| 153039 | 10.9% | |
| e | 146700 | 10.5% |
| s | 96052 | 6.9% |
| i | 84418 | 6.0% |
| n | 84302 | 6.0% |
| a | 82621 | 5.9% |
| r | 82474 | 5.9% |
| o | 75600 | 5.4% |
| t | 56913 | 4.1% |
| p | 50709 | 3.6% |
| Other values (43) | 488390 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1401218 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 153039 | 10.9% | |
| e | 146700 | 10.5% |
| s | 96052 | 6.9% |
| i | 84418 | 6.0% |
| n | 84302 | 6.0% |
| a | 82621 | 5.9% |
| r | 82474 | 5.9% |
| o | 75600 | 5.4% |
| t | 56913 | 4.1% |
| p | 50709 | 3.6% |
| Other values (43) | 488390 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1401218 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 153039 | 10.9% | |
| e | 146700 | 10.5% |
| s | 96052 | 6.9% |
| i | 84418 | 6.0% |
| n | 84302 | 6.0% |
| a | 82621 | 5.9% |
| r | 82474 | 5.9% |
| o | 75600 | 5.4% |
| t | 56913 | 4.1% |
| p | 50709 | 3.6% |
| Other values (43) | 488390 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1401218 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 153039 | 10.9% | |
| e | 146700 | 10.5% |
| s | 96052 | 6.9% |
| i | 84418 | 6.0% |
| n | 84302 | 6.0% |
| a | 82621 | 5.9% |
| r | 82474 | 5.9% |
| o | 75600 | 5.4% |
| t | 56913 | 4.1% |
| p | 50709 | 3.6% |
| Other values (43) | 488390 |
MANUFACTURER
Text
MISSING 
| Distinct | 4354 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 226474 |
| Missing (%) | 26.8% |
| Memory size | 6.5 MiB |
Length
| Max length | 54 |
|---|---|
| Median length | 41 |
| Mean length | 17.130166 |
| Min length | 2 |
Characters and Unicode
| Total characters | 10604909 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 553 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | PLACEHOLDER MANUFACTURER |
|---|---|
| 2nd row | COLGATE-PALMOLIVE |
| 3rd row | MAPLE HOLISTICS AND HONEYDEW PRODUCTS INTERCHANGEABLY. |
| 4th row | PLACEHOLDER MANUFACTURER |
| 5th row | HALEON |
| Value | Count | Frequency (%) |
| manufacturer | 108900 | 7.3% |
| inc | 87087 | 5.8% |
| placeholder | 86902 | 5.8% |
| 49150 | 3.3% | |
| llc | 48309 | 3.2% |
| company | 47379 | 3.2% |
| the | 27173 | 1.8% |
| johnson | 23485 | 1.6% |
| foods | 22006 | 1.5% |
| procter | 21065 | 1.4% |
| Other values (5168) | 974130 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1032996 | 9.7% |
| A | 904316 | 8.5% |
| 876508 | 8.3% | |
| R | 869407 | 8.2% |
| L | 706251 | 6.7% |
| O | 684769 | 6.5% |
| N | 680588 | 6.4% |
| C | 672156 | 6.3% |
| T | 482682 | 4.6% |
| I | 468787 | 4.4% |
| Other values (43) | 3226449 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10604909 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 1032996 | 9.7% |
| A | 904316 | 8.5% |
| 876508 | 8.3% | |
| R | 869407 | 8.2% |
| L | 706251 | 6.7% |
| O | 684769 | 6.5% |
| N | 680588 | 6.4% |
| C | 672156 | 6.3% |
| T | 482682 | 4.6% |
| I | 468787 | 4.4% |
| Other values (43) | 3226449 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10604909 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 1032996 | 9.7% |
| A | 904316 | 8.5% |
| 876508 | 8.3% | |
| R | 869407 | 8.2% |
| L | 706251 | 6.7% |
| O | 684769 | 6.5% |
| N | 680588 | 6.4% |
| C | 672156 | 6.3% |
| T | 482682 | 4.6% |
| I | 468787 | 4.4% |
| Other values (43) | 3226449 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10604909 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 1032996 | 9.7% |
| A | 904316 | 8.5% |
| 876508 | 8.3% | |
| R | 869407 | 8.2% |
| L | 706251 | 6.7% |
| O | 684769 | 6.5% |
| N | 680588 | 6.4% |
| C | 672156 | 6.3% |
| T | 482682 | 4.6% |
| I | 468787 | 4.4% |
| Other values (43) | 3226449 |
BRAND
Text
MISSING 
| Distinct | 8122 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 226472 |
| Missing (%) | 26.8% |
| Memory size | 6.5 MiB |
Length
| Max length | 42 |
|---|---|
| Median length | 36 |
| Mean length | 9.9704287 |
| Min length | 2 |
Characters and Unicode
| Total characters | 6172493 |
|---|---|
| Distinct characters | 65 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 1112 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | ELECSOP |
|---|---|
| 2nd row | COLGATE |
| 3rd row | MAPLE HOLISTICS |
| 4th row | BEAUHAIR |
| 5th row | EMERGEN-C |
| Value | Count | Frequency (%) |
| brand | 44190 | 4.3% |
| rem | 20813 | 2.0% |
| not | 17366 | 1.7% |
| known | 17025 | 1.7% |
| private | 13743 | 1.3% |
| label | 13468 | 1.3% |
| 8551 | 0.8% | |
| hair | 7961 | 0.8% |
| care | 7378 | 0.7% |
| cvs | 6400 | 0.6% |
| Other values (8213) | 864054 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 663544 | 10.8% |
| A | 543564 | 8.8% |
| R | 496839 | 8.0% |
| 401869 | 6.5% | |
| S | 395096 | 6.4% |
| N | 391809 | 6.3% |
| O | 381562 | 6.2% |
| I | 351955 | 5.7% |
| T | 330375 | 5.4% |
| L | 315642 | 5.1% |
| Other values (55) | 1900238 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6172493 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 663544 | 10.8% |
| A | 543564 | 8.8% |
| R | 496839 | 8.0% |
| 401869 | 6.5% | |
| S | 395096 | 6.4% |
| N | 391809 | 6.3% |
| O | 381562 | 6.2% |
| I | 351955 | 5.7% |
| T | 330375 | 5.4% |
| L | 315642 | 5.1% |
| Other values (55) | 1900238 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6172493 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 663544 | 10.8% |
| A | 543564 | 8.8% |
| R | 496839 | 8.0% |
| 401869 | 6.5% | |
| S | 395096 | 6.4% |
| N | 391809 | 6.3% |
| O | 381562 | 6.2% |
| I | 351955 | 5.7% |
| T | 330375 | 5.4% |
| L | 315642 | 5.1% |
| Other values (55) | 1900238 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6172493 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 663544 | 10.8% |
| A | 543564 | 8.8% |
| R | 496839 | 8.0% |
| 401869 | 6.5% | |
| S | 395096 | 6.4% |
| N | 391809 | 6.3% |
| O | 381562 | 6.2% |
| I | 351955 | 5.7% |
| T | 330375 | 5.4% |
| L | 315642 | 5.1% |
| Other values (55) | 1900238 |
BARCODE
Real number (ℝ)
| Distinct | 841342 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 4025 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.0161091 × 1011 |
| Minimum | 185 |
|---|---|
| Maximum | 6.2911082 × 1013 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.5 MiB |
Quantile statistics
| Minimum | 185 |
|---|---|
| 5-th percentile | 1.7082884 × 1010 |
| Q1 | 7.124923 × 1010 |
| median | 6.344185 × 1011 |
| Q3 | 7.683955 × 1011 |
| 95-th percentile | 8.91164 × 1011 |
| Maximum | 6.2911082 × 1013 |
| Range | 6.2911082 × 1013 |
| Interquartile range (IQR) | 6.9714627 × 1011 |
Descriptive statistics
| Standard deviation | 1.0225297 × 1012 |
|---|---|
| Coefficient of variation (CV) | 1.6996529 |
| Kurtosis | 81.861678 |
| Mean | 6.0161091 × 1011 |
| Median Absolute Deviation (MAD) | 2.5520957 × 1011 |
| Skewness | 6.1811281 |
| Sum | 5.0627182 × 1017 |
| Variance | 1.045567 × 1024 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 11461821 | 2 | < 0.1% |
| 20146900 | 2 | < 0.1% |
| 3454206 | 2 | < 0.1% |
| 3462003 | 2 | < 0.1% |
| 3422007 | 2 | < 0.1% |
| 906425 | 2 | < 0.1% |
| 3451304 | 2 | < 0.1% |
| 50426171 | 2 | < 0.1% |
| 3423905 | 2 | < 0.1% |
| 3416105 | 2 | < 0.1% |
| Other values (841332) | 841507 | |
| (Missing) | 4025 | 0.5% |
| Value | Count | Frequency (%) |
| 185 | 1 | |
| 3582 | 1 | |
| 4091 | 1 | |
| 5579 | 1 | |
| 5777 | 1 | |
| 5784 | 1 | |
| 6163 | 1 | |
| 6910 | 1 | |
| 9034 | 1 | |
| 10498 | 1 |
| Value | Count | Frequency (%) |
| 6.291108161 × 1013 | 1 | |
| 6.291100733 × 1013 | 1 | |
| 5.4114457 × 1013 | 1 | |
| 5.010724528 × 1013 | 1 | |
| 1.0895178 × 1013 | 1 | |
| 1.0857602 × 1013 | 1 | |
| 1.085748401 × 1013 | 1 | |
| 1.085748401 × 1013 | 1 | |
| 1.085748401 × 1013 | 1 | |
| 1.077098104 × 1013 | 1 |
| BARCODE | CATEGORY_1 | |
|---|---|---|
| BARCODE | 1.000 | 0.021 |
| CATEGORY_1 | 0.021 | 1.000 |
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
| CATEGORY_1 | CATEGORY_2 | CATEGORY_3 | CATEGORY_4 | MANUFACTURER | BRAND | BARCODE | |
|---|---|---|---|---|---|---|---|
| 0 | Health & Wellness | Sexual Health | Conductivity Gels & Lotions | NaN | NaN | NaN | 7.964944e+11 |
| 1 | Snacks | Puffed Snacks | Cheese Curls & Puffs | NaN | NaN | NaN | 2.327801e+10 |
| 2 | Health & Wellness | Hair Care | Hair Care Accessories | NaN | PLACEHOLDER MANUFACTURER | ELECSOP | 4.618178e+11 |
| 3 | Health & Wellness | Oral Care | Toothpaste | NaN | COLGATE-PALMOLIVE | COLGATE | 3.500047e+10 |
| 4 | Health & Wellness | Medicines & Treatments | Essential Oils | NaN | MAPLE HOLISTICS AND HONEYDEW PRODUCTS INTERCHANGEABLY. | MAPLE HOLISTICS | 8.068109e+11 |
| 5 | Health & Wellness | Hair Care | Hair Care Accessories | NaN | PLACEHOLDER MANUFACTURER | BEAUHAIR | 6.626585e+11 |
| 6 | Health & Wellness | Medicines & Treatments | Vitamins & Herbal Supplements | NaN | HALEON | EMERGEN-C | 6.177376e+11 |
| 7 | Health & Wellness | Deodorant & Antiperspirant | Men's Deodorant & Antiperspirant | NaN | NaN | NaN | 7.501839e+12 |
| 8 | Snacks | Snack Bars | Granola Bars | NaN | HYVEE INC | HY-VEE | 7.545013e+10 |
| 9 | Health & Wellness | NaN | NaN | NaN | CHURCH & DWIGHT | REPHRESH | NaN |
| CATEGORY_1 | CATEGORY_2 | CATEGORY_3 | CATEGORY_4 | MANUFACTURER | BRAND | BARCODE | |
|---|---|---|---|---|---|---|---|
| 845542 | Health & Wellness | Hair Care | Hair Color | NaN | L'OREAL | REDKEN | 8.844864e+11 |
| 845543 | Health & Wellness | Skin Care | Facial Cleansers | NaN | NORVELL SKIN SOLUTIONS, LLC | NORVELL | 8.125120e+11 |
| 845544 | Snacks | Pudding & Gelatin | Ready-to-Eat Pudding | NaN | PLACEHOLDER MANUFACTURER | BRAND NOT KNOWN | 4.427603e+10 |
| 845545 | Snacks | Nuts & Seeds | Covered Nuts | NaN | NaN | NaN | 7.729091e+10 |
| 845546 | Health & Wellness | Bath & Body | Liquid Hand Soap | NaN | NaN | NaN | 7.545029e+10 |
| 845547 | Health & Wellness | Topical Muscle & Joint Relief Treatments | Braces & Wraps | NaN | NaN | NaN | 7.223016e+11 |
| 845548 | Snacks | Cookies | NaN | NaN | TREEHOUSE FOODS, INC. | LOFTHOUSE | 4.182082e+10 |
| 845549 | Snacks | Candy | Confection Candy | NaN | HARIBO GMBH & CO KG | HARIBO | 1.001672e+11 |
| 845550 | Snacks | Nuts & Seeds | Hazelnuts | NaN | DOUBLE-COLA CO | JUMBO | 7.539076e+10 |
| 845551 | Health & Wellness | First Aid | First Aid Kits | NaN | 3M | NEXCARE | 7.967933e+11 |
Most frequently occurring
| CATEGORY_1 | CATEGORY_2 | CATEGORY_3 | CATEGORY_4 | MANUFACTURER | BRAND | BARCODE | # duplicates | |
|---|---|---|---|---|---|---|---|---|
| 13 | Restaurant | Beverages | Soda | NaN | THE COCA-COLA COMPANY | COCA-COLA | NaN | 19 |
| 12 | Restaurant | Beverages | Soda | NaN | PEPSICO | PEPSI | NaN | 5 |
| 10 | Restaurant | Beverages | Slushies & Icees | NaN | THE COCA-COLA COMPANY | COCA-COLA | NaN | 4 |
| 11 | Restaurant | Beverages | Soda | Diet Soda | PEPSICO | PEPSI | NaN | 4 |
| 2 | Health & Wellness | Medicines & Treatments | Allergy & Sinus Medicines & Treatments | NaN | HALEON | FLONASE | NaN | 3 |
| 3 | Health & Wellness | Medicines & Treatments | Vitamins & Herbal Supplements | NaN | HALEON | EMERGEN-C | NaN | 3 |
| 152 | Snacks | Chips | Crisps | NaN | KELLANOVA | PRINGLES | NaN | 3 |
| 174 | Snacks | Puffed Snacks | Cheese Curls & Puffs | NaN | THE HERSHEY COMPANY | PIRATE'S BOOTY | NaN | 3 |
| 175 | Snacks | Puffed Snacks | Popcorn | NaN | THE HERSHEY COMPANY | SKINNYPOP | NaN | 3 |
| 176 | Snacks | Snack Cakes | Brownie Snack Cakes | NaN | BIMBO | ENTENMANN'S SWEET BAKED GOODS | NaN | 3 |